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Laboratories working towards accreditation by the International Standards Organization 
(ISO) 15189 standard are required to demonstrate the validity of their analytical methods. 
The different guidelines set by various accreditation organizations make it difficult to pro- 
vide objective evidence that an in-house method is fit for the intended purpose. Besides, 
the required performance characteristics tests and acceptance criteria are not always de- 
tailed. The laboratory must choose the most suitable validation protocol and set the ac- 
ceptance criteria. Therefore, we propose a validation protocol to evaluate the performance 
of an in-house method. As an example, we validated the process for the detection and 
quantification of lead in whole blood by electrothermal absorption spectrometry. The fun- 
damental parameters tested were, selectivity, calibration model, precision, accuracy (and 
uncertainty of measurement), contamination, stability of the sample, reference interval, 
and analytical interference. We have developed a protocol that has been applied success- 
fully to quantify lead in whole blood by electrothermal atomic absorption spectrometry 
(ETAAS). In particular, our method is selective, linear, accurate, and precise, making it 
suitable for use in routine diagnostics. 
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Accreditation is a tool for medical laboratories to demonstrate 
their competence and to ensure the delivery of accurate and re- 
liable results. More than 70 accreditation bodies are recognized 
by the international laboratory accreditation cooperation (www. 
ilac.org). Accreditation for clinical reference laboratories can be 
voluntary like in Finland and Canada or mandatory like in France 
and Korea [1-5]. Clinical laboratories can be accredited by Inter- 
national Standards Organization (ISO) 17025 or ISO 15189. ISO 
15189 specifies the requirements of the quality management 
system for medical laboratories. Validation or verification of each 



test method is one of the requirements for accreditation (ISO 
15189 section 5.5.2) [1]. 

This manuscript compared various international recommen- 
dations to propose a validation protocol for in-house methods. 
This methodology was applied to quantify lead in whole blood 
samples by electrothermal atomic absorption spectrometry 
(ETAAS) [6-11]. 

The validation was performed on an M series electrothermal 
atomic absorption spectrometer (Thermo Scientific, Marietta, 
OH, USA). The temperature program consisted of four stages: 
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drying (95°C, 10 sec and then 110°C, 25 sec), pyrolysis (700°C, 
30 sec), atomization (1,200°C, 4 sec), and cleaning (2,500°C, 
4 sec). The equipment used a graphite tube with integrated 
LVOV platform. The radiation source was a mono-elementary 
hollow cathode lamp (X=283 nm). The Zeeman effect was used 
for background correction. Argon was used as the purge gas. 
All the chemicals used were of analytical reagent grade. Atomic 
absorption spectrometry standard solutions for lead (titrisol 
grades from Merck; Fontenay-sous-Bois, France) were used to 
build the calibration curves. They were prepared from a stock 
solution of 1 g/L by successive dilutions with reverse-osmosis 
purified water (Merck-millipore, Fontenay-sous-Bois, France). 

Each sample was ten-fold diluted in a matrix modifier (2.5 g/L 
of NH4H2PO4 prepared in 0.5% Triton X-100 [v/v], 0.065% 
HNO3 [v/v]). Matrix-spiked calibration was performed. Quantifi- 
cation was performed by the integration of absorbance over the 
time of the transient absorption signal. Internal control samples 
used were the ones that were commercially available at the 
Quebec National Public Health Department (INSPQ). Our labo- 
ratory participated in inter-laboratory comparison program orga- 
nized by INSPQ. This program had 6 cycles per year. 

The fundamental parameters tested were selectivity, calibra- 
tion model, precision, accuracy (and uncertainty of measure- 



ment), contamination, stability, reference interval, analytical sen- 
sitivity, and specificity. The latter two parameters were expressed 
as the detection limit and interference, respectively [6-9]. 

The acceptance or rejection criteria were set prior to valida- 
tion according to published data, specific legislative and regula- 
tory requirements, and professional judgment. To be validated, 
each parameter had to meet the acceptance criteria. Results 
and statistical analysis are detailed in Tables 1 and 2. 

To test selectivity, at least six blank samples were analyzed to 
ensure that there was no interference [7]. The wavelength used 
was specific to lead and allowed the user to distinguish between 
it and other components in the sample prior to quantification. 
No interference was noted. 

The calibration range was between 10 and 1,000 |jg/L. To as- 
sess the relationship between concentration and response, five 
calibration standards were run in duplicate, as recommended by 
the Australian national association of testing authorities (NATA), 
US food and drug administration (FDA), French Normalization 
Association (AFNOR), and Feinberg [7-11]. Five rounds of cali- 
bration were performed [10, 11]. The model was found to be 
linear. 

The least squares regression method was used to establish 
the relationship between the analyte concentration (x) and in- 



Table 1. Statistical tests of calibration model 



ANOVA 



Sources 
Regression residual 



FTest 



Slope existence 



Non linearity 

Repeatability 
J = 5 calibration sets 
1 = 6 calibration standards 
yi,j = signal measured 



Slope=0 

i.e. does not exist 



Goodness of the model Model fits the data 



CTest 



C-test 



I variances are equal 



Sum of squares (ss) 

SSreg 
SSresid 

SSnl — SSresid-SSr 

ssr= sissy, j) 

(e.g. SSyi,j for each calibration 
standard level) 



HI 

Slope ^0 
i.e. slope exists 

Model does not fit the data 



HI 



Degree of freedom (df) 
1 

IJ-1 
1-2 

J (1-1) 



Inference 
Reject HO 

i.e. the slope is something 
other than zero 

Accept HO 

i.e. the model is valid 



Inference 



At least, one variance value is significantly Accept f 
larger than the other variance values 



Mean squares (ms): ss/df 

mSreg 
mSresid 

msni 
msr 



F-statistic 

F=mSreg/mSresid 
F=mSreg/ mSr 

C-statistic 
Max (var (y,j)) 

N 

I (var (y,j)) 
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Table 2. Statistical results 



Linearity 



Slope of the regression model 
Intercept of the regression model 
Correlation coefficient 
Slope existence (Fisher) 
Goodness of the model (Fisher) 
Variances homogeneity (Cochran) 
Repeatability 

QC41pg/L 
QC114 pg/L 
QC 236 pg/L 
QC 532 \\g/l 
Reproducibility 

QC41pg/L 
QC114 pg/L 
QC 236 pg/L 
QC 532 pg/L 
Stability 

QC41pg/L 
QC114 pg/L 
QC 236 pg/L 
QC 532 pg/L 
Lower limit of quantification (LLOQ) 

lOpg/L 
15pg/L 
Uncertainty (%) 

<50 pg/L 
50-200 pg/L 
>200 pg/L 



Mean 
44 
120 
241 
539 

Mean 
37 
116 
249 
545 

Mean 
34 
116 
254 
565 

Mean 
9.8 
15.3 

Mean 
44 
124 
343 



Results 
0.00543 
0.07776 
0.989 
1053 (HS) 
0.039 (NS) 
0.328 (NS) 



Acceptance criteria 



References 



CV (%) 
7.6 
1.3 
1.4 
1.1 

CV (%) 
15.2 

3.3 

3.5 

2.9 

CV (%) 
9.2 
2.1 
2.6 
2.5 

CV (%) 
14.9 
15.1 

CV (%) 

38.5 

10.7 



F (0.05; 1; 23) = 4.49 
F(0.05; 5; 20) = 3.26 
C(0.05; 5; 4) = 0.616 

Acceptance criteria 
<7.6% 
<4.5% 
<4.5% 
<4.5% 

Acceptance criteria 
<20% 
<15% 
<10% 
<10% 

Acceptance criteria 
<20% 
<15% 
<10% 
<10% 

Acceptance criteria 
<20% 
<20% 

Acceptance criteria 
<40% 
<30% 
<20% 



7,10,11 
7,10,11 
7,10,11 

References 
9, 13 
9, 13 
9, 13 
9, 13 

References 
9, 12 
9, 12 
9, 12 
9, 12 

References 
7,9 
7,9 
7,9 
7,9 

References 

12,15 
12,15 

References 
12 
12 
12 



Abbreviations: HS, highly significant; NS, not significant; F, Fischer value; C, Cochran value. 



strument response (y), and to assess the validity of the calibra- 
tion model. Slope, intercept and coefficient of determination of 
the regression model were calculated. Since a high coefficient 
of determination did not guarantee that the model fit the data, 
the existence of a slope and the goodness of the model were 



tested. These tests were based on a Fisher's statistical test, 
which compared the variance of two different set of values. In 
order to evaluate each hypothesis, a research hypothesis (HI) 
and a null hypothesis (HO) were framed. The null hypothesis 
was rejected if the F-distribution calculated from the data was 
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greater than the critical value of the F-distribution for the desired 
false-rejection probability (0.05). 

The hypothesis and Formula are shown in Table 1. Two 
sources of variation were identified and used: regression error 
(ssreg) and residual error (ssresid), which was composed of non- 
linearity error and repeatability error (Table 1). Variances in ho- 
mogeneity of the five curves were evaluated by using Cochran's 
C Test, which was a one-sided upper limit variance outlier test 
(Table 1). As described previously, the null hypothesis, HO was 
tested against an alternative hypothesis, HI (Table 1). The up- 
per limit critical value (Ctheo) was chosen according to the de- 
sired significance level (a), the number of data series consid- 
ered (N), and the number of data points (n) per data series. If C 
exceeded Ctheo, HO was rejected, i.e variances were not equal 
(Table 1). These statistical tests confirmed the existence of a 
slope, the goodness of the model, and the homogeneity of the 
five calibrations curves. The deviation of standards from nomi- 
nal concentration was also calculated and it did not exceed 
15%. Hence, the model was acceptable. 

The lower limit of quantification (LLOQ) was fixed at 10 |jg/L. 
According to published data, a LLOQ between 10 and 15 jjg/L 
was considered normal [12-15]. 

It was established using five samples independent of stan- 
dards. Precision was within 20% of the coefficient variation. 
Precision included within-batch run or repeatability and be- 
tween-batch run or reproducibility. 

Analyte recovery lied within 80 and 120%, which was in the 
acceptable range. To assess repeatability, 4 controls (QC1=41 
Mg/L, QC2 = 114 Mg/L, QC3 = 236 Mg/U and QC4 = 532 \jg/l) 
were measured 30 times during a single analytical run. Repro- 
ducibility was studied for 20 days with 30 determinations per 
concentration of the same controls with a maximum of two runs 
per day. Intra-assay and inter-assay imprecision were evaluated 
as requested by French regulation authorities, and were less 
than 10% [12]. It was also possible to compare the perfor- 
mances of the method to published data or proficiency test re- 
sults [13-15]. On the contrary, biological imprecision criteria 
published by Vassault et al. [16] and Minchinela et al. [17] 
could not be used, as exogenous substances like lead were not 
included in their databases. 

Contamination was assessed using the protocol described in 
SH GTA 04 by the French Committee for accreditation (CO- 
FRAC) [11]. A high-level quality control (QCH 532 |jg/L) was 
analyzed in triplicate (QCHl, QCH2, and QCH3) and a low-level 
quality control (QCL 12 |jg/L) was also analyzed in triplicate 
(QCLl, QCL2, and QCL3). Five replicates were performed and 



the percentage of contamination was calculated using the for- 
mula (mean [QCH/QCL3-QCLl]xl00). It was less than 1% and 
within the acceptable range. 

The stability was evaluated on eight aliquots of each QC main- 
tained at the routine storage conditions (4°C) and during the in- 
tended storage period (14 days). The results were compared to 
the mean of the back-calculated values for the standards at the 
appropriate concentrations from the first day of long-term stabil- 
ity testing [7-9]. The mean values of analyte concentration were 
within ±10% of the nominal value and the CV did not exceed 
the acceptance criteria [9]. 

Uncertainty of the method was estimated via the ISO guides 
to the expression of uncertainty in measurement (GUM) ap- 
proach based on modeling the measurement and evaluating all 
sources of uncertainty [18, 19]. The uncertainty in patient result 
included both pre-analytical and analytical variations. Sources of 
pre-analytical uncertainty such as sample collection, handling, 
transport, and storage were difficult to estimate. When standard- 
ized procedures, such as guidelines for sample collection were 
followed, pre-analytical variations might be minimized [20]. 
Hence, to estimate uncertainty, we mainly considered the ana- 
lytical variations. Two components were identified as standard 
sources of uncertainty: long-term precision (intra-laboratory re- 
producibility) and uncertainty of the calibration (estimated using 
bias). All identified sources were combined by "error-propaga- 
tion" formulas to yield the standard uncertainty of the measure- 
ment results, which was multiplied by a factor of 2 to obtain the 
uncertainty. Estimates of measurement uncertainty showed re- 
sults that were acceptable by COFRAC standards [12] (Table 2). 
Reference interval for acceptable levels in human body was not 
verified since the acceptable levels were defined by various na- 
tional authorities. 

To conclude, all the parameters tested satisfied our accep- 
tance criteria. The validity of this method was established. This 
protocol seemed appropriate to guide laboratories in performing 
in-house methods validation like ETAAS. Validation allowed en- 
suring the appropriateness of test results. After initial validation, 
quality standards had to be maintained. ISO 15189 recom- 
mended the development of a quality control system to verify 
the intended quality of results. Medical laboratories have to de- 
velop continuous monitoring of assay performances [1, 6-9]. 
The laboratory should not get out-of-range results and our re- 
sults were acceptable. It could also include, for instance, an in- 
ternal control system, preventive maintenance of equipment, 
staff education and training, and internal audits [2, 3, 6]. 
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